Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED Br PTO SOFTWARE 



1 Wrapped Nucletcs 



Wrapped Aminos 



Incorrect Line Length 



The nunnberAext at the end of each fine "Wrapped* down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3, as this will prevent "Vrapping". 

The amino acid numt>er/lcxt at the end of each line "Wrapped " down to the next line. 
This may occur if your file was retrieved in a word processor after creating it. 
Please adjust your right margin to .3. as this will prevent "Wapping'. 

The rules require that a line not exceed 72 characters in length. This includes spaces. 
An text must t>e visible on page. 



Misaligned Amino Add , 
Numbering 

Non-ASCII 



Variable Length 



Wrong Designation 



8 Skipped Sequences 

(OLD RULES) 



10 



Skipped Sequences 
(NEW RULES) 



Use of N'sorXaa's 
(NEW RULES) 



The numbering under each 5th amino ackJ is misaligned. This may be caused by the use of Xzhy 
t>etween the numbering. It is reconnmended to delete any tabs and uses spacing between the numbers. 

This file was not saved in ASCII (DOS) text as required by the Sequence Rules. 

Please ensure your subsequent submissk>n is saved in ASCII text so that it can be processed. 

Sequence(s) contain n*s or Xaa's which represented nwro than one residue. 

As per the rules, each n orXaa can only represent a single rescue. 

Please present the maximum number of each residue having variable length and 

indicate in the (ix) features section that some may be missing. 

Sequencc(s) contain amino acid or nucleic acid designators which are not standard 

representations as per the Sequence Rules (Please refer to paragraph 1 .822) 

Sequencc{s) missing. If intentional, please use the following format for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: 

(I) SEQUENCE CHARACTERISTICS:(Do not insert any headings under "SEQUENCE CHARACTERISTICST 
(xl) SEQUENCE DESCRIPTION:SEQ ID NO:X: 
This sequence Is Intentionally skipped 

Please also adjust the "(iii*) NUMBER OF SEQUENCES;* response to include the skipped sequence(s). 

Sequence(s) missing. If Intentional, please use the following formal for each skipped sequence. 

<210> sequence Id number 
<400> sequence Id number 
000 

Use of N's and/or Xaa's have been detected In the Sequence Listing. 
Use of <220> to <223> is MANDATORY if n's or Xaa's are prescnL 



1 1 Use of <213>Organism Sequence(s) . 

(NEW RULES) 



, are missing this mandatory fieM or its response. 



12 Use of <220>Feature 

(NEW RULES) 



Sequence(s) are missing the <220>Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213>ORGANISM Is •ArtifidaP or "Unknown" 

(See -Federal Register." 6/01/98. Vd. 63. No. 104. pp. 29631-32) 
(Sec. 1.823 of new Sequence Rules) 



13 



Wrong F mriai' Re sut>mitted was In the atphat>etk:al heading format of the OW Sequence Rules.Thls b InvalW since th 

"Requirements for Patent Applk:atk>ns ConUlnlng NucleotkSe Sequence and/or Amin Acid Disctosures" 
Federal Register Notice. Vol, 63, No. 104. June 1. 1995, p. 29620 
applies to applications filed on or after July 1. 1998. 

AKS-Biotechnology Systems Branch- 7/10i/9d 



I 

I 

1615 



PAGE; 



RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/144,838 



DATE: 12/01/1998 
TIME: 12:57:03 



Input Set: 1144838. RAW 



w--> 
w--> 
w--> 
w--> 
w--> 



w--> 
w--> 
w--> 
w--> 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 
41 
42 
43 
44 



This Raw Listing contains the General Information 
Section and up to first 5 pages. 



<110> APPLICANT: Siani, Michael A 
Wilken, Jill 
S imon , Reyna 
Kent, Stephen B.H. 

<120> TITLE OF INVENTION: Modular Protein Libraries and Methods of Preparation 
<13 0> FILE REFERENCE: GRFN20/O1US 

<140> CURRENT APPLICATION NUMBER: US/09/144 , 83 8 
<141> CURRENT FILING DATE: 1998-08-31 
<150> EARLIER APPLICATION NUMBER: US 60/057,620 
<151> EARLIER FILING DATE: 1997-09-04 
<160> NUMBER OF SEQ ID NOS : 54 
<170> SOFTWARE: Patent In Ver. 2.0 
<210> SEQ ID NO 1 
<211> LENGTH: 81 
<212> TYPE: PRT 

<213> ORGANISM: Artificial Sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: Description of Artificial Sequence : synthetic 

<400>^-SEQ^JENefrs— 1 j/l^ 

Xaa Xaa Xaa Xaa Xaa Xaa XaaTxaai Cys ^Xaa/ Cys Leu lie 



Corr2?^^ Not cr. 



^ Cys^Xa^ Cys Leu lie Val MetQca^ ^^^X^/^ /O 

(^^ JC^aXaa Xaa Xaa Xaa ^eu lie Val Met Phe Tyr Xa^ Arg Lys Ser £aAj^\J 



lie Val 





lu Gin Leu 

60 



lie Val 



<210> 
<211> 
<212> 
<213> 
<220> 
<223> 
<400> 



aa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa/Cys Leu Asp 
5^1 75 80 



Met 

65 
Asn 

SEQ ID NO 2 
LENGTH: 82 
TYPE : PRT 



ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: Description of Artificial Sequence : synthetic 
SEQUENCE: 2 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Cys Cys Leu lie Val Met Phe 

15 10 15 

Tyr Thr Xaa Xaa Xaa Xaa Xaa Xaa Leu lie Val Met Xaa Xaa Xaa Xaa 

20 25 30 

Leu lie Val Met Phe Xaa Xaa Tyr Xaa Xaa Xaa Gly Ser Thr Asn Gly 

35 40 45 

Ser Thr Asn Xaa Xaa Cys Xaa Xaa Xaa Xaa Ser Ala Gly Leu lie Val 



1615 
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PATENT APPLICATION US/09/144,838 TIME: 12:57:03 

Input Set: 1144838. RAW 



45 






50 




55 






60 






46 




Met 


Xaa Xaa 


Phe 


Leu Xaa Xaa Xaa Xaa 


Xaa 


Arg 


Lys 


Thr 


Met Phe Xaa 


47 




65 






70 




75 






80 


48 




Xaa 


Cys 
















49 


<210> 


SEQ 


ID NO 3 
















50 


<211> 


LENGTH: 71 
















51 


<212> 


TYPE : PRT 
















52 


<213> 


ORGANISM: Artificial Sequence 












53 


<220> 


FEATURE : 
















54 


<223> 


OTHER INFORMATION: Description of Artificial Sequence : synthetic 


55 


<400> 


SEQUENCE : 3 
















56 




Ala 


Gly Ser 


Leu 


Val Ser Tyr Thr Pro 


Asn 


Ser 


Cys 


Cys 


Tyr Gly Phe 


57 




1 






5 


10 








15 


58 




Gin 


Gin His 


Pro 


Pro Pro Val Gin lie 


Leu 


Lys 


Glu 


Trp 


Tyr Pro Thr 


59 








20 


25 










30 


60 




Ser 


Pro Ala 


Cys 


Pro Lys Pro Gly Val 


He 


Leu 


Leu 


Thr Lys Arg Gly 


61 






35 




40 








45 




62 




Arg 


Gin lie 


Cys 


Ala Asp Pro Ser Lys 


Asn 


Trp 


Val 


Arg 


Gin Leu Met 


63 






50 




55 






60 






64 




Gin 


Arg Leu 


Pro 


Ala lie Ala 












65 




65 






70 












66 


<210> 


SEQ 


ID NO 4 
















67 


<211> 


LENGTH: 74 
















68 


<212> 


TYPE : PRT 
















69 


<213> 


ORGANISM: Artificial Sequence 












70 


<220> 


FEATURE : 
















71 


<223> 


OTHER INFORMATION: Description of Artificial Sequence : synthetic 


72 


<400> 


SEQUENCE: 4 
















73 




Gly 


Asp Thr 


Leu 


Gly Ala Ser Trp His 


Arg 


Pro 


Asp 


Lys 


Cys Cys Leu 


74 




1 






5 


10 








15 


75 




Gly 


Tyr Gin 


Lys 


Arg Pro Leu Pro Gin 


Val 


Leu 


Leu 


Ser 


Ser Trp Tyr 


76 








20 


25 










30 


77 




Pro 


Thr Ser 


Gin 


Leu Cys Ser Lys Pro Gly Val 


He 


Phe 


Leu Thr Lys 


78 






35 




40 








45 




79 




Arg 


Gly Arg 


Gin 


Val Cys Ala Asp Lys 


Ser 


Lys 


Asp 


Trp 


Val Lys Lys 


80 






50 




55 






60 






81 




Leu 


Met Gin 


Gin 


Leu Pro Val Thr Ala 


Arg 










82 




65 






70 












83 


<210> 


SEQ 


ID NO 5 
















84 


<211> 


LENGTH: 72 
















85 


<212> 


TYPE : PRT 
















86 


<213> 


ORGANISM: Artificial Sequence 












87 


<220> 


FEATURE : 
















88 


<223> 


OTHER INFORMATION; Description of Artificial Sequence : synthetic 


89 


<400> 


SEQUENCE: 5 
















90 




Ala Gly Ser 


Leu 


Val Ser Tyr Thr Pro 


Asn 


Ser 


Cys 


Cys 


Tyr Gly Phe 


91 




1 






5 


10 








15 


92 




Gin 


Gin His 


Pro 


Pro Pro Val Gin lie 


Leu 


Lys 


Glu 


Trp 


Tyr Pro Thr 


93 








20 


25 










30 


94 




Ser 


Pro Ala 


Cys 


Ser Lys Pro Gly Val 


He 


Phe 


Leu 


Thr 


Lys Arg Gly 



i 
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Input Set: 1144838. RAW 



95 






35 




40 






45 








96 




Arg 


Gin Val 


Cys 


Ala Asp Lys Ser 


Lys Asp Trp Val 


Lys 


Lys 


Leu 


Met 


97 






50 




55 




60 










98 




Gin 


Gin Leu 


Pro 


Val Thr Ala Arg 














99 




65 






70 














100 


<210> 


SEQ 


ID NO 6 


















101 


<211> 


LENGTH: 73 


















102 


<212> 


TYPE: PRT 


















103 


<213> 


ORGANISM: Artificial Sequence 














104 


<220> 


FEATURE : 


















105 


<223> 


OTHER INFORMATION: Description of Artificial Sequence : synthetic 


106 


<400> 


SEQUENCE: 6 


















107 




Gly 


Asp Thr 


Leu 


Gly Ala Ser Trp 


His Arg Pro 


Asp 


Lys 


Cys 


Cys 


Leu 


108 




1 






5 


10 








15 




109 




Gly 


Tyr Gin 


Lys 


Arg Pro Leu Pro 


Gin Val Leu 


Leu 


Ser 


Ser 


Trp 


Tyr 


110 








20 




25 






30 






111 




Pro 


Thr Ser 


Gin 


Leu Cys Pro Lys 


Pro Gly Val 


He 


Leu 


Leu 


Thr 


Lys 


112 






35 




40 






45 








113 




Arg 


Gly Arg 


Gin 


He Cys Ala Asp 


Pro Ser Lys 


Asn 


Trp 


Val 


Arg 


Gin 


114 






50 




55 




60 










115 




Leu 


Met Gin 


Arg 


Leu Pro Ala He 


Ala 












116 




65 






70 














117 


<210> 


SEQ 


ID NO 7 


















118 


<211> 


LENGTH: 93 


















119 


<212> 


TYPE : PRT 


















120 


<213> 


ORGANISM: Artificial Sequence 














121 


<220> 


FEATURE : 


















122 


<223> 


OTHER INFORMATION: Description of Artificial Sequence : synthetic 


123 


<400> 


SEQUENCE: 7 


















124 




Met 


Asn Ala 


Lys 


Val Val Val Val 


Leu Val Leu 


Val 


Leu 


Thr 


Ala 


Leu 


125 




1 






5 


10 








15 




126 




Cys 


Leu Ser 


Asp 


Gly Lys Pro Val 


Ser Leu Ser 


Tyr 


Arg 


Cys 


Pro 


Cys 


127 








20 




25 






30 






128 




Arg 


Phe Phe 


Glu 


Ser His Val Ala 


Arg Ala Asn 


Val 


Lys 


His 


Leu 


Lys 


129 






35 




40 






45 








130 




He 


Leu Asn 


Thr 


Pro Asn Cys Ala 


Leu Gin He 


Val 


Ala 


Arg 


Leu 


Lys 


131 






50 




55 




60 










132 




Asn 


Asn Asn 


Arg 


Gin Val Cys He 


Asp Pro Lys 


Leu 


Lys 


Trp 


He 


Gin 


133 




65 






70 


75 










80 


134 




Glu 


Tyr Leu 


Glu 


Lys Ala Leu Asn 


Lys Arg Phe 


Lys 


Met 








135 










85 


90 












136 


<210> 


SEQ 


ID NO 8 


















137 


<211> 


LENGTH: 67 


















138 


<212> 


TYPE: PRT 


















139 


<213> 


ORGANISM: Artificial Sequence 














140 


<220> 


FEATURE : 


















141 


<223> 


OTHER INFORMATION: Description of Artificial 


Sequence : synthetic 


142 


<400> 


SEQUENCE: 8 


















143 




Lys 


Pro Val 


Ser 


Leu Ser Tyr Arg 


Cys Pro Cys 


Arg 


Phe 


Phe 


Glu 


Ser 


144 




1 






5 


10 








15 





i 
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Input Set: 1144838. RAW 



145 




His 


Val Ala 


Arg 


Ala Asn Val Lys 


His 


Leu 


Lys 


He 


Leu 


Asn 


Thr Pro 


146 








20 




25 










30 




147 




Asn 


Cys Ala 


Leu 


Gin He Val Ala 


Arg 


Leu 


Lys 


Asn 


Asn 


Asn 


Arg Gin 


148 






35 




40 










45 






149 




Val 


Cys lie 


Asp 


Pro Lys Leu Lys 


Trp 


He 


Gin 


Glu 


Tyr 


Leu 


Glu Lys 


150 






50 




55 








60 








151 




Ala 


Leu Asn 




















152 




65 






















153 


<210> 


SEQ 


ID NO 9 




















154 


<211> 


LENGTH: 91 




















155 


<212> 


TYPE: PRT 




















156 


<213> 


ORGANISM: Artificial Sequence 
















157 


<220> 


FEATURE : 




















158 


<223> 


OTHER INFORMATION: Description of Artificial Sequence : synthetic 


159 


<400> 


SEQUENCE: 9 




















160 




Met 


Lys Val 


Ser 


Ala Ala Arg Leu 


Ala 


Val 


He 


Leu 


He 


Ala 


Thr Ala 


161 




1 






5 




10 










15 


162 




Leu 


Cys Ala 


Pro 


Ala Ser Ala Ser 


Pro 


Tyr 


Ser 


Ser 


Asp 


Thr 


Thr Pro 


163 








20 




25 










30 




164 




Cys 


Cys Phe 


Ala 


Tyr He Ala Arg 


Pro 


Leu 


Pro 


Arg 


Ala 


His 


He Lys 


165 






35 




40 










45 






166 




Glu 


Tyr Phe 


Tyr 


Thr Ser Gly Lys 


Cys 


Ser 


Asn 


Pro 


Ala 


Val 


Val Phe 


167 






50 




55 








60 








168 




Val 


Thr Arg 


Lys 


Asn Arg Gin Val 


Cys 


Ala 


Asn 


Pro 


Glu 


Lys 


Lys Trp 


169 




65 






70 






75 








80 


170 




Val 


Arg Glu 


Tyr 


He Asn Ser Leu 


Glu 


Met 


Ser 










171 










85 




90 












172 


<210> 


SEQ 


ID NO 10 


















173 


<211> 


LENGTH: 68 




















174 


<212> 


TYPE: PRT 




















175 


<213> 


ORGANISM: Artificial Sequence 
















176 


<220> 


FEATURE : 




















177 


<223> 


OTHER INFORMATION: Description of Artificial 


. Sequence : synthetic 


178 


<400> 


SEQUENCE: 10 


















179 




Ser 


Pro Tyr 


Ser 


Ser Asp Thr Thr 


Pro 


Cys 


Cys 


Phe 


Ala 


Tyr 


He Ala 


180 




1 






5 




10 










15 


181 




Arg 


Pro Leu 


Pro 


Arg Ala His He 


Lys 


Glu 


Tyr 


Phe 


Tyr 


Thr 


Ser Gly 


182 








20 




25 










30 




183 




Lys 


Cys Ser 


Asn 


Pro Ala Val Val 


Phe 


Val 


Thr 


Arg 


Lys 


Asn 


Arg Gin 


184 






35 




40 










45 






185 




Val 


Cys Ala 


Asn 


Pro Glu Lys Lys 


Trp 


Val 


Arg 


Glu 


Tyr 


He 


Asn Ser 


186 






50 




55 








60 








187 




Leu 


Glu Met 


Ser 


















188 




65 






















189 


<210> 


SEQ 


ID NO 11 


















190 


<211> 


LENGTH: 33 




















191 


<212> 


TYPE : PRT 




















192 


<213> 


ORGANISM: Artificial Sequence 
















193 


<220> 


FEATURE : 




















194 


<223> 


OTHER INFORMATION: Description of Artificial 


Sequence : synthetic 



i 
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PAGE: 5 RAW SEQUENCE LISTING DATE: 12/01/1998 

PATENT APPLICATION US/09/144,838 TIME: 12:57:03 

Input Set: 1144838. RAW 

195 <400> SEQUENCE: 11 

196 Lys Pro Val Ser Leu Ser Tyr Arg Cys Pro Cys Arg Phe Phe Glu Ser 

197 15 10 15 

198 His Val Ala Arg Ala Asn Val Lys His Leu Lys lie Leu Asn Thr Pro 

199 20 25 30 

200 Asn 

201 <210> SEQ ID NO 12 

202 <211> LENGTH: 32 

203 <212> TYPE: PRT 

204 <213> ORGANISM: Artificial Sequence 

205 <220> FEATURE: 

206 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic 

207 <400> SEQUENCE: 12 

2 08 Lys Pro Val Ser Leu Ser Tyr Arg Cys Cys Arg Phe Phe Glu Ser His 

209 1 5 10 15 

210 Val Ala Arg Ala Asn Val Lys His Leu Lys lie Leu Asn Thr Pro Asn 

211 20 25 30 

212 <210> SEQ ID NO 13 

213 <211> LENGTH: 33 

214 <212> TYPE: PRT 

215 <213> ORGANISM: Artificial Sequence 

216 <220> FEATURE: 

217 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic 

218 <400> SEQUENCE: 13 

219 Lys Pro Val Ser Leu Ser Tyr Arg Cys Pro Cys Phe Ala Tyr lie Ala 

220 1 5 10 15 

221 Arg Pro Leu Pro Arg Ala His lie Lys Glu Tyr Phe Tyr Thr Ser Gly 

222 20 25 30 

223 Lys 

224 <210> SEQ ID NO 14 

225 <211> LENGTH: 32 

226 <212> TYPE: PRT 

227 <213> ORGANISM: Artificial Sequence 

228 <220> FEATURE: 

229 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic 
23 0 <400> SEQUENCE: 14 

231 Lys Pro Val Ser Leu Ser Tyr Arg Cys Cys Phe Ala Tyr lie Ala Arg 

232 15 10 15 

23 3 Pro Leu Pro Arg Ala His lie Lys Glu Tyr Phe Tyr Thr Ser Gly Lys 

234 20 25 30 

235 <210> SEQ ID NO 15 

236 <211> LENGTH: 33 

237 <212> TYPE: PRT 

238 <213> ORGANISM: Artificial Sequence 

239 <220> FEATURE: 

24 0 <223> OTHER INFORMATION: Description of Artificial Sequence : synthetic 

241 <400> SEQUENCE: 15 

242 Ser Pro Tyr Ser Ser Asp Thr Thr Pro Cys Cys Phe Ala Tyr lie Ala 

243 15 10 15 

244 Arg Pro Leu Pro Arg Ala His lie Lys Glu Tyr Phe Tyr Thr Ser Gly 



PAGE: ■ '6 VERIFICATION SUMMARY DATE: 12/01/1998 

PATENT APPLICATION US/09/144,838 TIME: 12:57:03 

Input Set: 1144838. RAW 



Line ? Error /Warning 



20 


W 


"N" 


or 


"Xaa" 


used 


Feature 


required 


22 


W 


"N" 


or 


"Xaa" 


used 


Feature 


required 


24 


w 


"N" 


or 


"Xaa " 


used 


Feature 


required 


26 


w 


"N" 


or 


"Xaa " 


used 


Feature 


required 


28 


w 


"N" 


or 


"Xaa" 


used 


Feature 


required 


38 


w 


"N" 


or 


"Xaa" 


used 


Feature 


required 


40 


w 


"N" 


or 


"Xaa" 


used 


Feature 


required 


42 


w 


"N" 


or 


"Xaa" 


used 


Feature 


required 


44 


w 


"N" 


or 


"Xaa" 


used 


Feature 


required 


46 


w 


"N" 


or 


"Xaa" 


used 


Feature 


required 


48 


w 


"N" 


or 


"Xaa" 


used 


Feature 


required 



Original Text 



Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Cys 


Xaa 


C 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Leu 


He 


Val 


Met 


Phe 


T 


Glu 


Gin 


Xaa 


Leu 


He 


Val 


Met 


Xaa 


Xaa 


Leu 


I 


Xaa 


Xaa 


Ser 


Ala 


Gly Xaa 


Xaa 


Cys 


Xaa 


Xaa 


X 


Met 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


X 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Cys 


c 


Tyr 


Thr 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


Leu 


He 


V 


Leu 


He 


Val 


Met 


Phe 


Xaa 


Xaa 


Tyr 


Xaa 


Xaa 


X 


Ser 


Thr 


Asn 


Xaa 


Xaa 


Cys 


Xaa 


Xaa 


Xaa 


Xaa 


s 


Met 


Xaa 


Xaa 


Phe 


Leu 


Xaa 


Xaa 


Xaa 


Xaa 


Xaa 


A 



Xaa Cys 



